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Abstract 

Background: In the early stage of eukaryotic DNA replication, the template DNA is unwound by the MCM 
helicase, which is activated by forming a complex with the Cdc45 and GINS proteins. The eukaryotic GINS forms a 
heterotetramer, comprising four types of subunits. On the other hand, the archaeal GINS appears to be either a 
tetramer formed by two types of subunits in a 2:2 ratio (a 2 p2) or a homotetramer of a single subunit (a 4 ). Due to 
the low sequence similarity between the archaeal and eukaryotic GINS subunits, the atomic structures of the 
archaeal GINS complexes are attracting interest for comparisons of their subunit architectures and organization. 

Results: We determined the crystal structure of the a 2 p2 GINS tetramer from Thermococcus kodokoroensis 
(7/coGINS), comprising Gins51 and Gins23, and compared it with the reported human GINS structures. The 
backbone structure of each subunit and the tetrameric assembly are similar to those of human GINS. However, the 
location of the C-terminal small domain of Gins51 is remarkably different between the archaeal and human GINS 
structures. In addition, 7/coGINS exhibits different subunit contacts from those in human GINS, as a consequence of 
the different relative locations and orientations between the domains. Based on the GINS crystal structures, we 
built a homology model of the putative homotetrameric GINS from Thermoplosmo ocidophilum (TacGINS). 
Importantly, we propose that a long insertion loop allows the differential positioning of the C-terminal domains 
and, as a consequence, exclusively leads to the formation of an asymmetric homotetramer rather than a 
symmetrical one. 

Conclusions: The DNA metabolizing proteins from archaea are similar to those from eukaryotes, and the archaeal 
multi-subunit complexes are occasionally simplified versions of the eukaryotic ones. The overall similarity in the 
architectures between the archaeal and eukaryotic GINS complexes suggests that the GINS function, directed 
through interactions with other protein components, is basically conserved. On the other hand, the different 
subunit contacts, including the locations and contributions of the C-terminal domains to the tetramer formation, 
imply the possibility that the archaeal and eukaryotic GINS complexes contribute to DNA unwinding reactions by 
significantly different mechanisms in terms of the atomic details. 



Background 

DNA replication is an essential event for all living 
organisms, and thus the basic mechanism is conserved 
from bacteria to eukaryotes. Genomic DNA replication 
must be executed accurately and only once during the S 
phase of the cell cycle. Rapid and accurate DNA replica- 
tion requires the assembly of a large number of proteins, 
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termed the replisome, which directs major reactions, 
such as origin recognition, template DNA unwinding, 
and primer extension. Accumulating evidence has iden- 
tified the essential proteins for DNA replication, which 
have helped to provide a better understanding of the 
complex puzzle of DNA replication [1]. For example, in 
eukaryotes, a heterohexamer composed of the Mcm2- 
Mcm7 subunits works as the MCM helicase to unwind 
the template DNA, but this helicase activity is very low 
in vitro [2]. While Schwacha's group demonstrated the 
significant helicase activity of yeast Mcm2-7 alone [3,4] , 
extra protein factors, which enhance the MCM helicase 
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activity, have been found, and in particular, Cdc45, 
MCM, and GINS form a complex called the unwindo- 
some or the CMG complex, which plays an essential 
role in the template DNA unwinding reaction, and thus 
this complex is considered to be the functional replica- 
tive helicase [5-8]. GINS, named from the Japanese go- 
ichi-ni-san, meaning 5-1-2-3, is a heterotetramer in 
eukaryotes, comprising the Sld5, Psfl, Psf2, and Psf3 
subunits, and is essential for both initiation and elonga- 
tion in DNA replication [9-11]. GINS is recruited to the 
replication origin with Dpbll and Sld3 for the initiation 
and may functionally interact with DNA polymerase s 
[9]. A functional interaction between human GINS and 
DNA polymerase a/primase complex is also reported 
[12], indicating its important role to move from initia- 
tion to elongation of the DNA replication. GINS actually 
forms a transient complex, designated as the preloading 
complex (pre-LC), with DNA polymerase s, Sld2, and 
Dpbll, in a cyclin-dependent kinase (CDK)-dependent 
manner in budding yeast [13]. 

In 2007, three research groups reported the crystal 
structures of human GINS. Together, they showed a 
rigid and stable tetrameric core structure, with multiple 
flexible surface regions that are important for functional 
interactions with other DNA replication proteins 
[14-16]. Each subunit of human GINS consists of an a- 
helical main domain and a P -stranded small domain, 
and assembles into the heterotetramer, which exhibits a 
unique trapezoidal or ellipsoidal shape. Interestingly, the 
four subunits share a similar fold, in spite of the low 
amino acid sequence identity of at most 15%. Further- 
more, they are classified into two groups. One group, 
including human Sld5 and Psfl, possesses the a-helical 

(A) domain at the N-terminus and the P -stranded one 

(B) at the C-terminus (this arrangement is called AB- 
type), and the other group, comprising human Psf2 and 
Psf3, is the permuted version (BA-type; Figure 1A) 
[17-19]. 

It is well known that the DNA metabolizing proteins 
from archaea are similar to those from eukaryotes, in 
both structure and function [20]. Furthermore, the 
archaeal DNA replication system appears to be a simpli- 
fied version of the eukaryotic one. Therefore, structural 
studies of the archaeal system could be beneficial to 
understand the more complex DNA replication mechan- 
ism of eukaryotes. Bioinformatics analyses using geno- 
mic information from archaea and eukaryotes suggested 
that the archaeal GINS complexes are simplified ver- 
sions of the eukaryotic complexes. The archaeal com- 
plexes are probably either 012P2 type tetramers 
composed of the two types of subunits or a 4 type 
homotetramers of a single subunit, suggesting that the 
four different eukaryotic GINS subunits had diverged 
from a common ancestor [17]. The functional tetrameric 



assembly of the archaeal GINS has been proved by bio- 
chemical analyses. An archaeal GINS homologue was 
first identified in Sulfolobus solfataricus, as a binding 
partner of MCM [18]. The GINS complex from Pyrococ- 
cus furiosus increases the cognate MCM helicase activ- 
ity, in a similar manner to eukaryotic GINS, in spite of 
the lack of a eukaryotic Cdc45 homolog [21]. Due to the 
low sequence similarity between the archaeal and eukar- 
yotic GINS subunits, information about the three- 
dimensional structure of archaeal GINS complexes is 
crucial to obtain clearer insights into the functional sub- 
unit assembly. 

In this paper, we report the crystal structure of the 
OC2P2 GINS tetramer from Thermococcus kodakaraensis 
(T/coGINS), which consists of the Gins51 and Gins23 
subunits. This first crystal structure of an archaeal GINS 
was compared with that of human GINS. The overall 
fold of each subunit and the tetrameric assembly of Tko- 
GINS are essentially similar to those of human GINS. 
However, the locations of the C-terminal small domains 
are strikingly different between the two GINS, thus 
resulting in the formation of different subunit contacts 
in the complexes. Based on the two GINS crystal struc- 
tures, we built a homology model of the putative homo- 
tetrameric GINS from Thermoplasma acidophilum 
(TacGINS), and found that TacGINS could form the 
tetrameric subunit structure, in a similar manner to the 
a 2 p2 TkoGYNS tetramer and the human GINS heterote- 
tramer. These results suggest that the basic function of 
GINS in DNA replication is conserved across the 
domains of life. 

Results and Discussion 

Structure of 77coGINS 

The crystal structure of TkoGlNS was determined by 
the multiple isomorphous replacement method with 
anomalous dispersion (MIRAS), and the atomic model 
was refined at 2.65 A (Table 1). The crystal contains 
one pair of the Gins51 and Gins23 subunits in the 
asymmetric unit, and the protein forms the 012P2 tetra- 
mer generated by the crystallographic two-fold symme- 
try operation (Figure IB). 

Gins51 is composed of the larger a-helical domain at 
the N-terminus, and the smaller p -stranded domain at 
the C-terminus. In the Gins51 subunit, the long linker 
region between the two domains is disordered in the 
crystal, and 16 residues could not be assigned in the 
electron density map. Therefore, although we built an 
atomic model with the C-terminal domain contacting 
the nearest N-terminal one, the positions of the C-term- 
inal domains may be swapped between the two Gins51 
subunits to those in the tetrameric structure. Gins23 
also possesses similar structural domains as in Gins51, 
but the order of the two domains in the sequence is 
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Figure 1 Crystal structure of Thermococcus kodakaraensis GINS. (A) Domain organization of GINS from human, Thermococcus kodakaraensis, 
and Thermoplasma acidophilum. Cylinders represent A-domains mainly composed of a-helices and arrows show B-domains rich in p-strands. 
This coloring is used throughout the manuscript. The B domain of human Psfl is missing in the crystal structures. (B) Ribbon representation of 
the structure. The Gins51 subunits are colored green, and the Gins23 subunits are cyan. Missing parts are shown with dotted lines. The 
crystallographic two-fold axis is indicated. The figure was prepared with PYMOL http://www.pymol.org. 



permutated. Each domain of Gins23 can be separately 
superimposed on that of Gins5L The Gins23 N- terminal 
domain superimposed on the Gins51 C-terminal one 
with a root mean square deviation (RMSD) of 0.81 A, 
using the corresponding 38 Ca atoms, and the Gins23 
C-terminal domain superimposed on the Gins51 N- 
terminal one with an RMSD of 1.36 A, using 89 Ca 
atoms. 



Architectural comparison between the T. kodakaraensis 
and human GINS complexes 

The Tko GINS tetrameric structure is similar, in terms 
of shape and size, to the human GINS complex, com- 
prising the four different subunits, Sld5, Psfl, Psf2 and 
Psf3 [14-16]. While both crystal structures contain a 
narrow central hole, an electron microscopic analysis of 
human GINS revealed a horseshoe-shaped structure 
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Table 1 X-ray diffraction data collection and refinement 



Data Collection Summary 







P)pri\/3ti\/P 
L^CI 1 vci Live 








Native 


i C15U1 14 


SeMet 


K 2 PtCI 4 


Wavelength (A) 


1 .0000 


1 .2544 


0.9795 


1.0717 


Resolution (A) 


50.0-2.65 


50.0-3.16 


50.0-2.80 


50.0-3.19 


(Highest shell) 


(2.74-2.65) 


(3.27-3.16) 


(2.90-2.80) 


(3.30-3.19) 


Measured reflections 


227882 


127012 


181671 


112023 


Unique reflections 


16278 (1598) 


9676 (901) 


13780(1256) 


9034 (921) 


Completeness (%) 


99.4 (100.0) 


97.6 (95.1) 


99.1 (92.8) 


94.5 (99.9) 


IMf) 


17.6 (9.9) 


23.8 (11.4) 


14.5 (7.1) 


14.2 (7.4) 


Redundancy 


14.0 (14.0) 


13.1 (13.5) 


13.2 (9.7) 


12.4 (9.3) 


Emerge (%) 


4.3 (38.8) 


5.6 (23.0) 


7.5 (33.9) 


6.7 (38.6) 


MIRAS Phasing Statistics 


/?iso(f) (%) 




12.4 


21.1 


17.1 


Number of Sites 




2 


6 


1 


Resolution (A) 




50.0-4.0 


50.0-4.0 


50.0-4.0 


Phasing Power (Centric/Acentric) 




0.56/0.57 


0.64/0.54 


0.63/0.62 


Figure of merit (Centric/Acen.) 


0.32/0.36 









Refinement 






Resolution 
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(%) 
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Number of atoms 
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Protein 
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Bond Lengths 
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0.01 
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0 
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data excluding data used to calculate R free . 


c /? free was calculated using 5°/c 


> of the total reflections, which were chosen randomly and omitted from the refinement. 



with a large central hole [22]. It is tempting to speculate 
that GINS undergoes a conformational change between 
the open and closed forms, and the crystal structure 
exhibits a closed state. As expected, the spatial location 
of the r/coGins51 subunit in the tetramer corresponds 
to those in human Sld5 and Psfl, and that of 77<oGins23 
corresponds to those of Psf2 and Psf3 (Figure 2A). Tko- 
Gins51 from the upper layer superimposed onto human 
Sld5 and Psfl with RMSDs of 1.7 A and 2.3 A, using 87 
and 85 Ca atoms in the A domains, respectively (Figure 
2B, upper line). In the case of the lower layer subunits, 
the two Gins23 subunit structures are entirely similar to 
their human counterparts, including the relative loca- 
tions between the two domains. Indeed, 27<oGins23 
superimposed onto Psf2 with an RMSD of 2.1 A using 
93 Ca atoms, and onto Psf3 with an RMSD of 1.6 A 
using 82 Ca atoms. These structural similarities 



highlight the evolutionary conservation between the 
archaeal and eukaryotic GINS complexes (Figure 2B, 
lower column). Notably, Swiatek and MacNeill reported 
the structural similarity between the B domains of the 
human GINS proteins and the C-terminal domain 
(CTD) of the primase small subunit (PriS)-CTD) from 
Sulfolobus solfataricus [23]. The B domains of the Tko- 
GINS subunits are also similar to PriS-CTD [see Addi- 
tional file 1], supporting an interesting hypothesis that 
archaeal PriS acquired its CTD by straightforward 
sequence duplication. 

Despite the similarities in the internal architecture of 
each subunit and the overall morphology of the tetra- 
meric subunit assembly, notable differences are observed 
between the archaeal and human GINS complexes. 
Firstly, the locations of the C-terminal B domains are 
strikingly different between the archaeal Gins51 and the 
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(A) 71oGINS 

Gins51 
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Gins51 vs. Sld5, rmsd: 1.7 A (87a.a.) Gins51 vs. , rmsd: 2.3 A (85a.a.) 




Gins23 vs. Psf2, rmsd: 2. 1 A (93a.a.) Gins23 vs. , rmsd: 1 .6 A (82a.a.) 

Figure 2 Comparison of the archaeal and human GINS complexes. (A) Comparison of the tetrameric structures between the Thermococcus 
kodokoraensis and human GINS complexes. (B) Comparison of the corresponding subunits. 



human Sld5. In the superimposed structures, the Tko- 
Gins51 B domain is located about 30 A away from that 
of human Sld5 (Figure 2B). In the human complex, the 
B domain of Sld5 contributes to the stable tetramer for- 
mation, and the observed location in the crystal struc- 
ture is in fact important for its function [15]. 

On the other hand, the corresponding domain of 
human Psfl is highly mobile. Kamada et al. and Choi et 
al. crystallized the GINS proteins lacking the Psfl B 
domain [14,15]. Although Chang et al. solved the crystal 
structure of the full-length GINS, they could not 



observe the electron density for the Psfl B-domain in 
the crystal [16]. Thus, to determine whether the Gins51 
B domain is required for the stable tetramer formation, 
we examined the oligomeric state of a TkoGYNS trunca- 
tion mutant, lacking the B domain of Gins51, by gel fil- 
tration. As shown in Figure 3A, this truncation mutant 
forms a stable tetramer, establishing that the C-terminal 
B domain of Gins51 is not essential for stable tetramer 
formation. Notably, the isolated Gins51 subunit forms a 
dimer irrespective of the presence or absence of the C- 
terminal B domain, and the isolated Gins23 also eluted 
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Figure 3 Gel filtration chromatography of TTcoGINS complexes. 

(A) Elution profiles of the wild type (thin line) complex and a 
truncation mutant (thick line) complex lacking the Gins51 C-terminal 
domain (Gins51AC: Met1-Arg130) (upper part). The elution positions 
of the marker proteins, thyroglobulin (670 K), immunoglobulin G 
(158 K), ovalbumin (44 K), and myoglobin (17 K), are shown by 
arrows on the top. The elution volume (ml) for each molecule is 
indicated on the top with the apparent molecular mass shown in 
parenthesis. Aliquots of each fraction were subjected to 12.5% SDS- 
PAGE analysis followed by Coomassie Brilliant Blue staining (lower 
part). (B) Gel filtration chromatography and SDS-PAGE of the 
isolated Gins51, Gins23 and Gins51AC. 



as a dimer (Figure 3B). It is presently unclear whether 
the 27<oGins51 B domain occupies a functional position, 
as in the corresponding domain of human Sld5, or is 
incidentally fixed there, due to crystal packing [see 
Additional file 2]. However, the mobility of this B 
domain is conserved from archaea to human, and hence 
it seems likely that these movements of the B domain 
are required for the function of GINS complexes. 

Comparison of subunit contacts 

A second difference was observed in the subunit con- 
tacts, particularly between the upper and lower layers 
(Figure 4). Overall, the archaeal Gins51-Gins51 (upper- 
upper) and Gins23-Gins23 (lower-lower) interactions 




Tko Gins51-Gins23 


Human: Sld5-Psf2 


Human: Psfl-PsO 


{ Q9maB\-K\ 


f Sld5-A >v 
/sid5-B\ r^l^r^\ 
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\G1mi23-A J 
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Figure 4 Subunit contacts in the Thermococcus kodakaraensis 
and human GINS complexes. Close-up views of the upper-lower 
layer subunit contacts, shown by stereo pairs. The conserved 
intermolecular helix-helix contacts observed in the first area are 
indicated by red arrows, and those unique in each upper-lower 
subunit pair in the second area are indicated by blue arrows. The 
conserved tryptophan residues are depicted by stick models. (A) T. 
kodakaraensis Gins51-Gins23. (B) Human Sld5-Psf2 and (C) Psf1-Psf3. 
(D) Schematic drawing of the subunit contacts. 



are similar to those of the corresponding contacts, Sld5- 
Psfl and Psf2-Psf3, of the human GINS, respectively. 
They all exhibit a similar intermolecular four a-helix 
bundle structure, in which two a -helices from each sub- 
unit contribute to the contact. On the other hand, the 
contact between Gins51 and Gins23 (upper-lower; Fig- 
ure 4A) is strikingly different from the corresponding 
Sld5-Psf2 (Figure 4B) or Psfl-Psf3 (Figure 4C) 
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interactions of human GINS. This difference could be 
attributed to the relative orientations between the upper 
and lower subunits. 

The contact between the upper and lower layer subu- 
nits could be divided into two areas: one involves the A 
domain from the upper layer subunit and the B domain 
from the lower layer subunit (the right side in Figure 4), 
while the other involves the A domain from the lower 
layer subunit. In TkoGINS, the first contact area is 
formed mainly between the Aa3 helix of Gins51 and 
the prominent Ba2 helix of Gins23, and additionally 
between the Aal helix of Gins51 and the N-terminal 
loop of Gins23 (Figure 4A). We found that the intermo- 
lecular helix-helix interaction is similarly observed in 
the human GINS structures, although the detailed con- 
tact modes, that is, the amino acid residues, contributing 
to the contact, are different between them [see Addi- 
tional file 3]. However, a tryptophan residue is notably 
conserved among TkoGins23 (Trp30) and human Psf2 
(Trp47) and Psf3 (Trp71) in this area (Figures 4 and 
5A). Our preliminary analysis suggested that this trypto- 
phan residue is conserved among the lower layer subu- 
nits from eukaryotes and archaea (Data not shown). 
Therefore, it is tempting to speculate that this contact 
may be a key factor to generate the similar tetramer for- 
mation between the archaeal and human GINS 
complexes. 

On the other hand, the upper-lower layer subunit con- 
tact in the second area varies among the three interac- 
tions, in terms of the relative orientations between the 
upper and lower subunits. 77<0Gins23 is located close to 
Gins51, allowing the C-terminal Aa5 helix to interact 
with the Aal helix of Gins51 (Figure 4A). In the corre- 
sponding area of the human Sld5-Psf2 interaction, the 
C-terminal A domain of Psf2 is farther away from Sld5, 
as compared to 77<oGINS. Instead, it is the B domain of 
Sld5 that faces Psf2, to form a broad contact surface 
(Figure 4B). This could be the reason why the Sld5 B 
domain in human GINS is required for stable complex 
formation [14]. Conversely, the corresponding B domain 
of 27<oGins51 may be dispensable for tetramer forma- 
tion, because Gins51 and Gins23 can interact with each 
other without the Gins51 B domain. In addition, the 
corresponding interaction is missing between human 
Psfl and Psf3, due to the high mobility of the B domain 
of Psfl and the distance between the C-terminal A 
domains of Psf3 and Psfl. Instead, the N-terminal loop 
of Psf3 contacts the A domain of Psfl, but this segment 
is reportedly dispensable for tetramer formation [16]. 
Notably, I7coGins23 lacks the corresponding segment at 
the N- terminus (Figures 4C and 5A). 

The observed similarities and differences between the 
two-fold symmetric TkoGINS and the asymmetric 
human GINS may reflect the different evolutional 



processes from the last universal common ancestor. 
Archaea may have kept the simple subunit composition 
of GINS, which functions better in their simple DNA 
replication system. On the other hand, eukaryotes prob- 
ably required the heterotetrameric GINS so that GINS 
can play multiple roles in their highly coordinated 
systems. 

Homology modeling of the TacGINS homotetramer 

The 012P2 TkoGINS and aa/pp' human GINS complexes 
share a similar tetrameric assembly, in spite of the 
above-mentioned differences in the subunit contacts, 
implying that this conserved architecture is essential for 
the GINS functions. Thus, there is a strong interest in 
the structure of another type of GINS, that is, an 
archaeal a 4 homotetrameric GINS. Gel filtration and 
electron microscopic analyses showed that TacGlNS 
indeed forms a homotetramer (Ogino et al, data not 
shown). In the absence of the three-dimensional struc- 
ture of TacGlNS, we performed homology modeling 
using the present crystal structure of TkoGINS. 

The crystal structures of the TkoGINS tetramers 
revealed a stack of a two-fold symmetric dimer of the 
AB-type subunits (27coGins51) onto another two-fold 
symmetric dimer of BA-type subunits (27roGins23). The 
geometric transformation linking both homodimers 
requires a simple translation without any rotational 
component. This is also the situation in the human 
GINS complex, although in that complex, the homodi- 
mers are heterodimers. In both cases (TkoGINS and the 
human GINS), the open interfaces at the top and bot- 
tom of the tetramer are different from those used for 
dimer stacking. This asymmetry is the reason why 
another dimer does not pile up on top of the tetramer. 
However, if TacGlNS forms a similar tetramer, com- 
posed of a single subunit, then there is no reason why it 
might not polymerize to form a fiber-like assembly. 
Indeed, in this case, the interface used for dimer-dimer 
stacking would also exist at the top and bottom of the 
tetramer and would allow other dimeric modules to 
assemble. The second problem is the location of the 
small B domain in the TacGlNS homotetramer. All of 
the GINS subunits are composed of an A domain and a 
B domain, but the two domains can be permutated. 
Sld5 and Psfl of human GINS and Gins51 of TkoGINS 
are the AB-type, with the A domain at the N-terminus 
and the B domain at the C-terminus. The location of 
the two domains in the sequence is opposite in human 
Psf2 and Psf3 and 77<oGins23, forming the BA-type. A 
structure-based sequence alignment (Figure 5A) revealed 
that TacGins is an AB-type subunit. Thus, the question 
arises as to how the authentic GINS tetramer can be 
formed with the only AB-type subunit in the I^cGINS 
a 4 tetramer. 
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Figure 5 Homology model of the Thermoplasma acidophilum GINS homotetramer. (A) Structure-based sequence alignment of GINS 
subunits of Homo sapiens (Hs), Thermococcus kodokoroensis (Tk), and Thermoplasma acidophilum (Ta). Amino acid residues are colored according 
to their characteristics (aromatic, cyan; hydrophobic, green; basic, magenta; ambivalent, orange). Secondary structural elements are indicated on 
the top of the sequences. Numbering of a-helices and (3-strands is according to the human GINS subunits [15]. (B) Two types of models in 
different conformations, the AB- and BA-types, are shown on the top and bottom, respectively. For the models, the A domain (residues 1 to 123, 
green in AB-type and blue in BA-type), the ID-loop (residues 124 to 172, white in both models), and the B domain (residues 173 to 233, red in 
AB-type and orange in BA-type) are differently colored. (C) A model of the TacGINS homotetramer. (D) Class averages of electron microscopic 
images of the TacGINS complex. The two-dimensional class averages of the EM images of the TacGINS complex (upper row). Similar projections 
(middle row) were obtained from the assumed atomic model. The atomic models of the TacGINS tetramer in the corresponding orientations are 
shown (lower row). The side length of the individual images in the upper and middle rows is 13.2 nm. 



The flexibility in the spatial position of the B domain 
relative to the A domain, which was revealed by the 
comparison of TkoGINS and human GINS, and the dis- 
order of the interdomain loop observed in the TkoGYNS 



structure could provide a reasonable solution for these 
problems. The solution assumes two types of structural 
models for the T^cGins subunits (Figure 5B), which 
could be possible by using the long (nearly 40 amino 
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acid residues) interdomain loop. This loop is much 
longer than those of the other homologous subunits and 
is rich in glutamic acid, glutamine, and proline residues, 
and therefore is likely to form an intrinsically disordered 
(ID) structure (ID-loop in Figure 5A). Figure 5C shows 
a possible homotetramer model of TacGINS. This tetra- 
mer model appears to simultaneously solve the above- 
mentioned two problems. On the upper layer, we tenta- 
tively placed the B domains of TacGins in a similar 
orientation to those in the 27<oGins51 subunits (Figure 
5B). It should be noted that other locations are possible, 
because as seen in the two GINS crystal structures, the 
B domains of the AB-type subunits could be mobile. For 
example, human GINS was crystallized by truncating 
the mobile B domain of Psfl [14,15], and in T/coGINS, 
the Gins51 B domain is not required for stable tetramer 
formation (Figure 3). However, regardless of the location 
of the B domains, the ID-loop should protrude above 
the two domains, which could prevent the stacking of 
another dimer on the top interface (Figure 5B, AB-type). 
On the other hand, on the lower layer, we constructed a 
model in which the AB-type TacGins mimicked the 
domain positioning of the BA subunit (Figure 5B, BA- 
type). The ID-loop is long enough for this domain posi- 
tioning. Taken together, the interdomain ID-loop of 
TacGins could play two roles in the homotetramer: to 
block non-functional polymerization on the tetramer 
surfaces, and to form a long connector to allow large 
domain repositioning. In other words, we propose that 
this long ID-loop allows the differential positioning of 
the B-domains and, as a consequence, exclusively leads 
to an asymmetric homotetramer, rather than a symme- 
trical one. 

To experimentally validate the predicted TacGINS tet- 
ramer model, we performed an electron microscopic 
analysis of TacGINS (Figure 5D). The two-dimensional 
class averages thus obtained exhibited various shapes, 
according to the different orientations of the complex. 
Interestingly, a number of these class averages were very 
similar to the projections simulated from the predicted 
tetramer model. 

Conclusions 

We determined the 2.65 A resolution crystal structure 
of the first archaeal GINS complex from T kodakaraen- 
sis, which forms a a 2 (3 2 heterotetramer composed of the 
Gins51 and Gins23 subunits. The overall tetrameric 
structure of the archaeal GINS is similar to those of the 
reported human GINS complexes, but the locations of 
the small domains and the contact modes between the 
subunits are different between the archaeal and human 
GINS complexes. The gel filtration experiment revealed 
that the B domain of 77<oGINS is not required for tetra- 
mer formation. This mobility is similar to the 



corresponding domain of Psfl of human GINS, but dif- 
ferent from the other corresponding domain of Sld5 of 
human GINS, which is required for stable tetramer for- 
mation. These differences imply that minor variations 
exist in the interactions with other DNA metabolizing 
proteins, in terms of the atomic details. Next, we built a 
reasonable model of the T acidophilum GINS homote- 
tramer, which has a highly similar structure to those of 
the T kodakaraensis and human GINS crystal struc- 
tures. Taken together, these structural analyses suggest 
that the GINS function is essentially conserved between 
archaea and human. Considering this important view- 
point, an archaeal homolog of the eukaryotic Cdc45 
protein, which would participate in a replicative helicase 
complex similar to that of eukaryotes, could exist, 
although it has not been identified yet, presumably 
because of its highly divergent sequence. The present 
structure and the homology model of archaeal GINS 
will provide a structural basis for clarification of the 
GINS functions, including the formation of the CMG 
unwindosome complex. 

Methods 

Protein expression and purification 

The recombinant TkoGlNS complex was produced by 
co-expression of the two genes for Gins51 and Gins23 
on the different expression vectors, using the conven- 
tional Escherichia coli expression system. The ginsSl 
and gins23 genes were cloned into pET21a (Amp r ) and 
modified pET28 (Km 1 ), respectively. These two plasmids 
were introduced into E. coli codonPlus™ (DE3)-RIL 
cells (Stratagene, La Jolla, CA, USA) forcibly against 
incompatibility, and the cells containing the two plas- 
mids were selected on the agar plate containing both 
kanamycin and ampicillin. The transformation efficiency 
was low, but practical for further experiments. The 
modification of the pET28a is the replacement of the 
thrombin recognition sequence with the Tabaco etch 
virus (TEV) protease protease recognition sequence for 
removal of the His-tag region at the N-terminus of the 
target protein. The cells were grown in Luria Broth (LB) 
medium at 37°C to an OD 600 of 0.5, and then the pro- 
duction of the TkoGlNS complex was induced by 1 mM 
isopropyl-P-D-thiogalactoside (IPTG), with further culti- 
vation at 25°C for 15 hours. The cells were harvested 
and disrupted by sonication in buffer A (50 mM Tris- 
HC1, pH 8.0, 0.1 mM ethylenediaminetetraacetic acid 
(EDTA), 0.5 mM dithiothreitol (DTT) , and 10% gly- 
cerol). The soluble fraction was collected by centrifuga- 
tion (23 000 x g, 10 minutes) and then was incubated at 
80°C for 20 minutes, to remove the heat-sensitive pro- 
teins derived from the host cells. The supernatant was 
loaded onto a hydrophobic column (HiTrap Phenyl, GE 
Healthcare, Hino, Tokyo, Japan) and developed with a 
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1.0 to 0 M linear gradient of ammonium sulfate. The 
pooled fraction was dialyzed against buffer B (10 mM 
K-phosphate, 7 mM P-mercaptoethanol, 0.05 mM 
CaCl 2 , and 10% glycerol), and was loaded onto a CHT- 
II hydroxyapatite column (Bio-Rad, Shinagawa, Tokyo, 
Japan)), which was developed with a linear gradient of 
0.01 to 1 M K-phosphate. The fraction pool was subse- 
quently dialyzed against buffer C (50 mM Tris-HCl, pH 
8.0, 0.1 mM EDTA, 100 mM NaCl, and 10% glycerol), 
and was loaded onto an anion exchange column 
(HiTrap Q, GE Healthcare). The column was developed 
with a linear gradient of 0.1 to 0.6 M NaCl. The purified 
protein was concentrated to 20 mg/ml for crystalliza- 
tion. To produce the selenomethionine (SeMet) deriva- 
tive of TkoGlNS, the E. coli cells harboring the 
expression plasmids were grown in minimal medium 
containing seleno-L-methionine, at a final concentration 
of 25 (ig/ml. The SeMet protein was expressed and puri- 
fied using the same procedure as for the wild type 
complex. 

The preparation of the recombinant TacGlNS protein 
will be described elsewhere (Ogino et al). Briefly, the 
gene encoding the protein (Tal042) was cloned into the 
pET21a vector, and was overexpressed in E. coli BL21 
codonPlus™ (DE3)-RIL cells. The harvested cells were 
disrupted by sonication, and then the soluble fraction 
was heated at 60°C for 20 minutes. The heat-stable frac- 
tion was treated with polyethyleneimine to remove the 
nucleic acids, and then was subjected to hydrophobic 
(HiTrap Phenyl) and anion exchange (HiTrap Q) col- 
umn chromatographies. 

Crystallization, data collection, and model refinement of 
7/coGINS 

Initial crystallization screening of TkoGlNS was per- 
formed by the hanging-drop vapor diffusion method at 
293 K, using a Mosquito nanodrop dispenser (TTP Lab- 
tech, Cambridge, MA, USA), followed by manual opti- 
mization. In the initial screening, 200 nl aliquots of the 
protein solution were mixed with an equal volume of 
reservoir solutions from commercially available screen- 
ing kits, JCSG Core 1 to 4 (Qiagen, Chuo, Tokyo, 
Japan). TkoGlNS crystallized under nine conditions out 
of the 384 in the semi-automatic setup. An optimization 
procedure yielded the best diffraction quality crystals, 
which were obtained by mixing 1 \A of the protein solu- 
tion with an equal volume of a reservoir, containing 0.1 
M 4-(2-hydroxyethyl)-l-piperazineethanesulfonic acid 
(HEPES) (pH 7.5), 0.2 M magnesium chloride and 24% 
to approximately 26% (v/v) PEG400. The crystals 
belonged to the space group P4 1 2 1 2, with unit cell 
dimensions a = b = 74.3 A, c = 181.3 A, and contained 
one Gins51 subunit and one Gins23 subunit per asym- 
metric unit. The Se-Met protein was crystallized by 



essentially the same procedure as for the wild-type Tko- 
GlNS. Tantalum (Ta 6 Br 14 )- and platinum (K 2 PtCl 4 )- 
derivatized crystals were produced by adding a tiny 
amount of the heavy-atom water solution into the crys- 
tallization drop. Due to the presence of a high concen- 
tration of PEG400 and glycerol in the crystallization 
drop, no additional cryo-protectant was required. Crys- 
tals were picked up from the crystallization drop and 
directly flash-cooled in the nitrogen gas stream at 100 K 
for X-ray diffraction data collection. All data sets were 
collected on BL38-B1 of SPring-8 (Harima, Japan) and 
processed by the HKL2000 package [24]. 

Structure determination by molecular replacement, 
using the human GINS structure, was unsuccessful, 
even though many subunit combinations were tested as 
probes. Therefore, the structure was determined experi- 
mentally by the MIRAS method, using the above three 
derivatives. All of the heavy atom sites were located on 
isomorphous or anomalous Patterson maps, and the 
heavy atom parameters were refined by the program 
SHARP [25]. The phases were improved by density 
modification techniques, with the programs DM and 
SOLOMON in the CCP4 suite [26]. The atomic model 
was built with the program O [27], and the crystallo- 
graphic refinement was performed with the program 
CNS [28]. Manual remodeling and refinement were iter- 
ated until satisfactory convergence was achieved. The 
final atomic model contained residues 1 toll3, 129 to 
186 of Gins51, seven amino acids derived from the N- 
terminal tag, residues 1 to 93, 101 to 131, 134 to 164 of 
Gins23, and 33 water molecules. Residues 114 to 128 
and 187 to 188 of Gins51 and residues 94 to 100 and 
132 to 133 of Gins23 were missing in the electron den- 
sity map. All non-glycine and proline residues are 
located in either the most favored, additionally allowed, 
or generously allowed regions of the Ramachandran plot 
(data not shown). The crystallographic analyses are sum- 
marized in Table 1. The atomic coordinates of TkoGlNS 
have been deposited in the Protein Data Bank, under 
the accession code 3ANW. 

Gel filtration chromatography of the purified 7/coGINS 

Gel filtration chromatography was performed to analyze 
the oligomeric states of the TkoGlNS subunits and the 
complexes, using a SMART system and a Superdex 200 
PC 3.2/30 column (GE Healthcare). The column was 
equilibrated with buffer D (50 mM Tris-HCl, pH 8.0, 
0.15 M NaCl, 7 mM p-mercaptoethanol, 0.1 mM EDTA). 
A 20 \il aliquot of each protein (20 [iM) was applied to 
the column, which was developed with buffer D. Aliquots 
of the fractions were subjected to 12.5% SDS-PAGE ana- 
lysis followed by Coomassie Brilliant Blue staining. A 
Bio-Rad gel filtration standard was used to estimate the 
molecular weights of the GINS complexes. 
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Homology modeling of TacGINS 

The structure-based alignment was made by using the 
program MATRAS [29]. The homology model of Tac- 
GINS was constructed by using the Homology module 
of the MOE application (Ryouka Systems, Inc., Chuo, 
Tokyo, Japan), which was based on the methods of 
Levitt [30] and Fechteler et al [31]. First, the amino 
acid sequence of TacGins (conserved hypothetical pro- 
tein of T. acidophilum; accession code: CAC12170) was 
aligned with 27<oGins51, as shown in Figure 5 A, and a 
homology model was generated for the AB-type subu- 
nits, according to the sequence alignment (Figure 5B). 
In order to model the BA-type subunits, the sequences 
of the N- and C-terminal (A and B) domains of TacGins 
were separately aligned with the C- and N-terminal 
domains of 27<oGins23, respectively, and the domains 
were connected with the ID-loop sequence of TacGins. 

When the subunits of TacGYNS were assembled into a 
tetramer, by superposing them onto each corresponding 
subunit of the TkoGlNS tetramer, the ID-loop of the BA 
subunits exhibited extensive atomic clashes with the 
upper layer subunits. Therefore, the ID-loop was manu- 
ally rebuilt, to avoid atomic clashes. Finally, the TacG INS 
tetramer was energy minimized, in order to remove other 
atomic clashes among the subunits (Figure 5C). 

Electron microscopy and single particle image analysis of 
TacG INS 

The purified TacGlNS complex was suspended in a buf- 
fer containing 50 mM Tris-HCl (pH 8.0) and 40 mM 
NaCl, (final protein concentration: 5 (ig/ml). A 3 [i\ ali- 
quot of the sample solution was applied to a copper grid 
supporting a continuous thin-carbon film, left for 1 min- 
ute, and then stained with three drops of 2% uranyl acet- 
ate. Images of molecules were recorded by an Eagle 2 K 
CCD camera (FEI, Hillsboro, OR, USA) with a pixel size 
of 2.76 A/pixel, using a Tecnai T20 electron microscope 
(FEI) operated at an accelerating voltage of 200 kV. A 
low dose system was used to reduce the electron radia- 
tion damage of the sample. A total of 1,989 images of the 
TacGYNS complex were selected, using the BOXER pro- 
gram in EMAN [32] . The two-dimensional class averages 
were obtained using the refine2d tool of EMAN, assum- 
ing 50 classes. The simulated two-dimensional projection 
maps were calculated from the atomic model of the Tac- 
GINS tetramer, obtained by homology modeling. 

Additional material 



Additional file 1: Structure comparison of the 7/coGINS B domains 
with the C-terminal domain of the primase small subunit (PriS-CTD) 
from Sulfolobus solfataricus (PDB code 1ZT2 chain A). (A) Stereo view 
of the superimposed structures. 77coGins51 B domain is colored green, 
77coGins23 B domain is cyan, and Pris-CTD is pink. The Gins23 B domain 
superimposed on the Gins51 B domain with an RMSD of 0.81 A, using 



the corresponding 38 Ca atoms, and the PriS-CTD superimposed on the 
Gins51 B domain with an RMSD of 0.85 A, using 31 Ca atoms. (B) 
Structure-based sequence alignment. 

Additional file 2: Crystal packing interactions. (A) Overall view of the 
crystal packing. Each tetramer contacts the surrounding four tetramers in 
the crystal with the same interaction mode. (B) Close-up view of the 
packing interaction boxed in (A). A Gins51 B domain contacts a Gins23 B 
domain in the neighboring tetramer. 

Additional file 3: Detailed subunit contacts in the GINS complexes. 

Close-up views of the subunit contacts between Gins51 and Gins23 in 
TtoGINS (A), Sld5 and Psf2 in human GINS (B), and Psf1 and Psf3 in 
human GINS(C) are shown by stereo pairs. Residues involved in the 
contacts are depicted with stick models. (D) to (F) Schematic 
representations of the contacts. 
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MCM: minimichrosome mentenance; Cdc: cell division cycle; Sid: synthetic 
lethal with dpbl 1 ; Psf: partner of Sid 5; Dpb: DNA polymerase B; pre-LC: 
preloading complex; CDK: cyclin-dependent kinase; MIRAS: multiple 
isomorphous replacement method with anomalous dispersion; RMSD: root 
mean square deviation; CTD: C-terminal domain; PriS: primase small subunit; 
ID: intrinsically disordered; Amp: ampicillin; Km: kanamycin; TEV: Tabaco etch 
virus; LB: Luria broth; IPTG: isopropyl-p-D-thiogalactoside; EDTA: 
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hydroxyethyl)-1-piperazineethanesulfonic acid; PEG: polyethylene glycol. 
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